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Abstract — Adiabatic quantum programming defines tlie time- 
dependent mapping of a quantum algorithm into an underlying 
hardware or logical fabric. An essential step is embedding 
problem-specific information into the quantum logical fabric. 
We present algorithms for embedding arbitrary instances of the 
adiabatic quantum optimization algorithm into a square lattice 
of specialized unit cells. These methods extend with fabric growth 
while scaling linearly in time and quadratically in footprint. We 
also provide methods for handling hard faults in the logical 
fabric without invoking approximations to the original problem, 
and illustrate their versatility through numerical studies of 
embeddabilty versus fault rates in square lattices of complete 
bipartite unit cells. The studies show these algorithms are more 
resilient to faulty fabrics than naive embedding approaches, a 
feature which should prove useful in benchmarking the adiabatic 
quantum optimization algorithm on existing faulty hardware. 

Index Terms — quantum computing, adiabatic quantum opti- 
mization, graph embedding, fault-tolerant computing 



I. Introduction 

Adiabatic quantum optimization (AQO) applies the princi- 
ples of quantum computing to solve unconstrained optimiza- 
tion problems. In particular, the AQO algorithm interpolates 
between two quantum logical Hamiltonians in order to adia- 
batically transform an initial quantum state to a computational 
solution state [16l. This specialized application of adiabatic 
quantum computing has been used to solve a variety of 
problems including, for example, instances of satisfiability 
(SAT) 1 15 1 and exact cover fT6\, finding Ramsey numbers 
|[r8|, classifying binary images [21|, training classifiers for 
machine learning ||23l and finding the lowest free-energy 
configuration in folded proteins 1221 . 

Benchmarking the efficiency of the AQO algorithm is 
currently of significant interest in quantum computer science. 
Whereas some studies of optimization problems have uncov- 
ered runtimes that scale polynomially in problem size, others 
suggest worst-case exponential behavior, or even trapping in 
local minima (2\. Interpreting these analyses are difficult, 
in part, because of the manner in which instance-specific 
information alters the implementation of the algorithm, i.e., 
programming. As emphasized by others fTP], [121, fTS], 
choices made in programming the AQO algorithm greatly 
impact its runtime and, consequently, the observed scaling 
behavior 
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Benchmarking adiabatic algorithms is further complicated 
when the design of the logical Hamiltonians is constrained. 
Because the AQO algorithm uses a reduction of the classical 
optimization problem to a quantum logical representation, 
i.e., a Hamiltonian, any constraints placed on this underlying 
logical fabric can only limit performance. Understanding the 
impact of the logical fabric is especially pertinent since 
existing AQO hardware supports a specific topology over a 
relatively modest number of qubits lfT4l . lfT9l . 

Adiabatic quantum programming has been described pre- 
viously as requiring two steps: parameter setting |9| and 
minor embedding ||T0]| . Minor embedding, in particular, uses 
explicit information about the logical fabric as well as the 
problem to generate the implementation of the AQO algorithm. 
Choi has demonstrated how an arbitrary input graph can be 
minor embedded within one type of highly regular fabric, a 
square lattice of K4,4S, complete bipartite graphs with eight 
vertices. In the current paper, we also present algorithms 
for minor embedding into additional logical fabrics, namely, 
square lattices of Kc.c with c > 1. We present an attempt at 
a brute force embedding via graph isomorphism in maximal 
minors (henceforth called maximal minor embedding) as well 
as an algorithm for complete-graph embedding. We compare 
these algorithms in terms of their complexity as well as the 
scaling of the embedding result. 

Notwithstanding algorithms for the unit-cell lattice, an open 
question in adiabatic quantum programming is how to handle 
fabrics containing randomized hard faults. Hard faults refer to 
defects in the logical fabric that compromise its regularity. As 
their locations are random, the embedding algorithm must han- 
dle a variety of target graphs. In the current paper, we present 
methods for minor embedding that use heuristics to adapt to 
random faults in the logical fabric (hardware). We analyze al- 
gorithmic performance in terms of the maximum embeddable 
complete graph obtained using numerical simulations. These 
studies quantify the impact of faults on the required logical 
footprint and provide performance expectations for hard fault- 
tolerant adiabatic quantum programming. 

The paper is organized as follow: Sec. II defines the role of 
minor embedding in adiabatic quantum optimization; Sec. Ill 
briefly reviews previous work; Sec IV defines nomenclature 
and presents implications of treewidth on graph embeddability; 
Sec. V recounts properties of the unit-cell lattice; Sec. VI 
determines treewidth for hardware graphs F; Sec. VII presents 
embedding of a complete graph in F; Sec. VIII presents two 
algorithms for embedding with hard faults and numerical tests 
of these algorithms using randomized fault placement; finally. 
Sec. IX presents our conclusions. 
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II. Adiabatic Quantum Optimization 

The AQO algorithm is based on the reduction of an uncon- 
strained optimization problem to a quantum logical Hamil- 
tonian that is diagonal in the computational basis |16|. The 
reduction most naturally begins in terms of binary variables 
that can then be mapped to the qubits of a logical Hamiltonian 
Hp. For AQO, the problem Hamiltonian takes the form 

where is the weight on the z-th qubit, /3i j is the coupling 
between qubits i and j, and the sets Vp and Ep denote the 
vertices and edges of the graph F describing the logical fabric; 
a more formal definition of the hardware graph is found in 
Sec. HV] In this setting, the Pauli Zi operator defines the 
computational basis for the i-th qubit. 

The 2-local form of Eq. ([T]i restricts the optimization 
problems that can be mapped directly into Hp. Specifically, 
any binary optimization problem can be recast to have at most 
quadratic interactions, i.e., as a quadratic unconstrained binary 
optimization (QUBO) problem. This reduction can be done 
by, e.g., substituting the product of two variables with a new 
one and adding a corresponding penalty term |8|. The AQO 
program input is therefore defined as the QUBO problem 

arg min x'^Px, (2) 

xGB" 

where x is a vector of n binary variables and P is an n-by-rt 
symmetric real-valued matrix. 

In programming the QUBO problem, the interactions be- 
tween variables represented by P must be mapped into the 
quantum logical fabric. We interpret P as a weighted version of 
the adjacency matrix of an input (problem) graph P describing 
these dependencies. Hence, programming the AQO algorithm 
requires embedding P in the graph F representing the logical 
fabric. We defer the formal definition of minor embedding 
to Sec. II. B, but it suffices to say that this yields a graph 
F* — {V*,E*) contained within the logical fabric, over which 
a Hamiltonian Hp» is defined as 

= ^ /3-,Z.Z, (3) 

i€V' {i,j)eE' 

with a* and f3* ^ the corresponding weights and couplings. 
Setting these parameters requires both the matrix P and the 
embedding into the logical fabric specified by F* |9|. 

The program for the AQO algorithm is then expressed by 
the time-dependent Hamiltonian 

H{t-T) = A{t-T)Hi + B{t:T)Hp., (4) 

where A{t) and B{t) control the time-dependent interpolation 
between an initial Hamiltonian Hi and the final embedded 
problem Hamiltonian Hp* . The time T represents the anneal- 
ing time of the algorithm, such that H{T) — Hp*. Running 
the program H{t) requires initializing the quantum register 
state to be a ground state of H{0). This is followed by 
annealing to the time T after which the register is measured. 
Provided the conditions of the adiabatic theorem are met, 
the state of the register at T will be a ground state of 



Hp* and a solution to the QUBO problem. In order to meet 
these conditions, T must scale inversely with the minimum 
spectral gap of H{t) liTSl . The gap, of course, depends on 
the programmed implementation and we may expect that the 
choice of embedding plays a role in satisfying this condition. 

III. Previous Related Work 

In ifTOl . Choi described a hardware graph for minor em- 
bedding a large clique, in a limited number of qubits. 
This layout was called TRIAD. Choi also discussed using 
the TRIAD scheme on a 128 qubit hardware made up of a 
4x4 grid of ^^44 cells to achieve the embedding of Kit. 
We note the figure in ifTOl corresponding to this description in 
that paper only embeds a Kiq, but it is possible to embed Kn 
using the TRIAD scheme. It is also worth noting that while 
the text claimed a requirement of only 6 physical vertices 
for each logical qubit, this is not achievable with the given 
hardware (and is not realized in the example given). Our work 
results in the same embedding for Ku on the 4x4 grid, 
but then extends the algorithm to work on a large family 
of related logical fabrics. We also provide a straightforward 
algorithm for extending an embedding from an n x n grid to 
an (n+ 1) X (?t.+ 1) grid of Kc.c cells. This paper additionally 
determines the treewidth of the family of fabric graphs, which 
enables better screening of QUBOs for feasible embeddability. 
Perhaps most importantly, prior work did not consider the case 
of faulty fabric, which we address with two algorithms and a 
set of simulations to demonstrate performance. 

IV. Graph Minors and Tree-decomposition 

A graph G = {V, E) is a set of vertices V and a set of 
edges E formed by unordered pairs of vertices. In this paper, 
all graphs are finite, simple (no loops or multiple edges), and 
undirected. A graph H = {W,F) \s a. subgraph of G, denoted 
H <Z G, if W QV wd F <Z E. 

A path in G — {V, E) is a sequence of vertices 
wi, W2, ■ • ■ , Wfc such that for 1 <i <k, {vi,Vi+i) G E. A cycle 
is a path where vi = v^. If there are no repeated vertices, the 
path (cycle) is a simple path (cycle). 

A graph is connected if there is a path from u to w for every 
pair of distinct vertices u,v inV . A tree is a connected graph 
which does not contain any simple cycles as subgraphs. We 
say a graph is a subtree of G if H C G and H is a tree. 

Programming adiabatic quantum computing hardware to 
solve a specific problem requires embedding a problem graph 
P — [Vp, Ep) representing the QUBO problem (elements of 
Vp correspond to QUBO variables and Ep — Pij 7^ 

0}) into a hardware graph F = [Vp^Ep) whose vertices 
representing the qubits and edges are determined by couplings 
in the logical fabric. In some cases, this can be done in a one- 
to-one manner through subgraph embedding. 

Definition 1: A subgraph embedding of P into is a 
mapping f : Vp ^ Vp such that: 

• each vertex in Vp is mapped to a unique vertex in Vp. 

. if e Ep, then {f{u)J{v)) G Ep. 

Note that if such an / exists, P is a subgraph of F, P C F. 
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However, due to design constraints on the underlying logical 
fabric, in order to consider a large class of QUBO problems, 
P will need to be embedded into F as a minor 

Definition 2: A minor embedding of P in F is defined by 
a mapping cf) : Vp ^ Vp such that: 

• each vertex v in Vp is mapped to the vertex set of a 
connected subtree of F. 

• if {u,v) G Ep, then there exist G Vp such that 

iu G Tu, iv G Tt,, and {i 

If such a mapping exists, then P is minor-embeddable in F 
or P is a minor of P, written P <„i F. 

Equivalently, P is minor-embeddable in P if P can be 
obtained from P by a series of edge deletions and contractions 
(see [13 1 for more information on graph minors). Note that 
every subgraph embedding is also a minor embedding (since 
f{v) is a single node subtree of P). Furthermore, the property 
of being a minor is transitive: G <m F and P G implies 
P<m P. 

Closely related to the idea of a graph minor is the concept 
of a tree decomposition, a combinatorial way of measuring 
how "tree-like" a graph is. Many early results on graph minors 
were first proved for trees 1131 . Additionally, certain problems 
which have exponential complexity on arbitrary graphs have 
been shown to have polynomial complexity on graphs of 
bounded treewidth. More importantly, certain properties of 
tree decompositions, including upper bounds on treewidth (the 
definition of which can be found below), are closed under 
the taking of minors. Understanding the tree decomposition of 
the hardware graph gives us information about the properties 
of the minors the graph has and, thus, what sort of QUBO 
problems can be embedded. 

Definition 3: Given a graph G = {V, E) let T = (/, D) be 
a tree, and V — {Vt}iei be a family of vertex sets (also called 
bags) with Vi ^ V indexed by the elements of I. The pair 
(T, V) forms a tree decomposition of G if the following hold: 

1) V = U,eiV,. 

2) if {u, v) e E, then there exists i e / such that {u, v} C 

3) for ii, 12,^3 e I, if *3 lies on the path in T between 
ii and ^2, then V^j CiVi^Q Vi^. Equivalently, for any 
vertex v V, {i : v G Vi} forms a connected subtree 
of T. 

To avoid confusion, the elements of V are referred to as 
the vertices of G and the elements of / as the nodes of 
T. The width of a tree decomposition (P, V) is given by 
maxig/{|Vi| — 1}. The treewidth t{G) of a graph G is the 
minimum width over all tree decompositions of G. Note that 
the width of any tree decomposition of G gives an upper 
bound on t(G). The following lemmas are well-known in 
graph theory and are useful for using treewidth to analyze 
the quantum hardware graphs described in Sec. W\ 

Lemma 1: If iJ is a minor of G (i.e. H is minor- 
embeddable in G), then t{H) < t{G). 




Fig. \. A 4x4 array of A'4 4 unit cells coupled as in the hardware graph 
from fT9l . 

Thus, given the treewidth of a logical fabric P, it is possible 
to automatically narrow down the class of QUBO problems for 
which it may be possible to find an embedding. The treewidth 
of several classic families of graphs is known exactly: 

Lemma 2: Let Kn be the complete graph on n vertices and 
Kn^n the complete bipartite graph on 2n vertices. 

1) t(X„) =n-\. 

2) T{Kn.n) = n. 

3) The treewidth of an n x m 2-D planar grid is given by 
min{m, n}. 

For more information on tree decomposition and graph 
minors (including the proofs of the above lemmas) see |i5|, 
chapter 12 of [13J, and O?). 

V. Description of Hardware graph 

In this section, we review the hardware graph that has been 
the basis for several proposed or demonstrated experimental 
studies r4l, fT4l, HSl, Ell. The building blocks of this graph 
are 8-qubit unit cells whose internal couplings form 4 |[T9l . 
Unit cells are tiled together with each qubit on the left half of 
a P'4.4 connected to its image in the cells directly above and 
below, and each qubit on the right half of the P'4.4 connected 
to its image in the cells directly to the left and right. A 
representation of the graph formed by sixteen cells is shown 
in Fig. [U Note that due to the way the qubits are physically 
connected lfT9l . when there is a failure, it will be the failure of 
a qubit and not an individual coupler. In terms of the hardware 
graph, this means vertices (and all their adjacent edges) will 
fail, not individual edges. 

In our analysis, we consider extensions of the unit cell 
design to include an increase in the number of qubits forming 
a cell. We also parameterize the hardware fabric to allow 
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for expanding the grid of unit cells. In general, our results 
are applicable in the setting where cells consist of 2c qubits 
forming a Kc.c and are attached to form an m x m grid in 
the same manner as described above. We denote a hardware 
graph of this form as F{m,c). For example, the hardware 
graph shown in Fig. [T] corresponds to F(4,4). 

For ease of reference, we define a labeling on Vp(m.c) - First, 
we number a single cell: the vertices on the left half of the 
Kc^c as 1,2, ... ,c from top to bottom, and the vertices on the 
right half of the Kc^c as c + l,c + 2, . . . ,2c, again from top 
to bottom. See Fig. |5(a)| for an example of this numbering in 
a K4 4 cell. Each vertex in VF(r?j.c) is then given a label of 
the form ^ where (a, b) is the (row, column) position of the 
cell containing the vertex in the m x m grid - with cell (1,1) 
in the upper left corner - and d corresponds to the position of 
the vertex inside the individual cell, as described above. 

VI. Treewidth of the Hardware graph 

As seen in Lemma [T] if the treewidth of the hardware graph 
is known, it can be used to a priori rule out the possibility of 
embedding certain classes of QUBOs. 

However, in general, determining the treewidth of an 
arbitrary graph G is NP-complete [^, fl]. In |6|, Bodlaender 
describes a linear time algorithm to determine whether a 
graph has treewidth at most k, for a given fixed k. However, 
the constants for the algorithm are extremely large (and 
grow exponentially with fc), making it impractical for most 
graphs, including the hardware graphs of interest here. Amir 
describes a polynomial-time algorithm which finds a factor- 
0{\og{T{G))) approximation of the treewidth of a graph G 
||3j|, however we have tighter bounds for the treewidth of the 
hardware graph F{m,c), as presented below. 

Theorem 1: Let Fijn, c) be a hardware graph made up of 
an m X m array of cells, attached as described in Sec. |V] 
where each cell contains 2c qubits connected to form a K^^c- 
Then, 

1) the treewidth of a single cell (to = 1) is c. 

2) cm < T{F(rn, c)) < cm + c — 1 for to > 2. 

Corollary 1: Any QUBO problem P of treewidth 
t(P) > cm + c is not minor embeddable in the hardware 
graph F{m, c). 

Corollary 2: Any QUBO problem which contains a 
Kcm+c+i (either as a subgraph or as a minor) cannot be 
embedded into the hardware graph F{m, c). 

Corollary 3: Any QUBO problem which contains a 
c(to + 1) X c(m + 1) grid (either as a subgraph or as a minor) 
cannot be embedded into the hardware graph F{m, c). 

Thus, even though the hardware graph described in 
Corollary |2] contains 2cto^ qubits, a Kcm+c+i, which would 
need only c(to + 1) + 1 logical qubits (if they were all 
coupled in the fabric), is shown to not be embeddable, due to 
its treewidth. 



Proof of Thin. |7} The proof of (1) follows directly from 
Lemma |2] Furthermore, the lower bound of (2) follows from 
using the algorithm in Sec. |VII-B] to embed a Kcm+i into G, 
since by Lemma |2] T{Kcm+i) = cto, and Lemma [T] implies 

cm ^ t{Kc„i+i) < T{F{m, c)). 

The upper bound is slightly harder to compute. The proof 
consists of constructing a tree decomposition of F{m, c) with 
width cm + c — 1. Then, since the treewidth of F{m, c) is the 
minimum width over all tree decompositions, cm + c — 1 is 
an upper bound. 

To form a tree decomposition (T, V) of width 
CTO + c - 1, we start with Vi = {vi'\^,v'iV, ■ • • , 

c+2 2c c+1 c+2 2c '1 2 c' 1 

"1.2 7 • ■ • 1 '^1,21 '^l^mi "^l^mi ■ • ■ J "l,rm '^l.lJ '^1,1? • ■ • : "^l,!/- 

That is, Vi contains the right half of every cell in the first 
column of the grid plus the left half of the (1,1) cell. 

The idea is to create all other bags of the decomposition by 
sequentially dropping/adding the left/right halves of individual 
cells. Each new bag will be formed by removing one of 
these sets of four vertices from an existing bag, and adding a 
(different) set of four - specifically one that is not yet contained 
in any existing bag. The large amount of overlap between 
the bags is to ensure that the third requirement of Def. [3] is 
satisfied. 

The bags V2, . . . , Vm of the decomposition are formed by 
dropping the right sides of cells in the first column and 
picking up the left sides, one-by-one. That is, Vi contains 
the right half of cells i + 1 through m in the first column, 
the left half of cells 1 through i — 1, and all of cell i. 
More formally, for 2 < i < to, Vi = {v^\^ , . . . ,vf'i, 

• ■ • ' ^rrtl ' ■ • • ' ^m,li ■ ■ ■ J ^IAt ■ ■ : ■ ■ ■ J "'^i^,! J • • ■ : ^lU }■ 

the tree being formed, T, the first to nodes form a path. 

The next to bags are formed by (again) starting with Vi but 
adding the right hand sides of the cells in the second column: 
for Vm+i we drop the remaining four vertices in the left half 
of the first column and add the top four in the right half of 
the second; for Vm+i with 2 < i < m, we add ^2^^, ■ • ■ , 
and remove Wj^li, . . . , Bag V2m+i is then formed by 

dropping the last four vertices from the first column and adding 
the four left vertices of the top cell in the second column. Note 
that V2m+i is the exact same "shape" as Vi, only one column 
over There is an edge between node 1 and node m + 1 in T, 
then nodes m + 2 through 2m continue the path. 

At this point, the tree decomposition branches, with two new 
bags attached to V2m+i (analogous to Vi). The first is V2m+2, 
which starts the branch consisting of V2m+2, ■ ■ ■ , Vsm, with 
V2m+i dropping ■ • ■ and adding v^^,...v^^. 

Note this is equivalent to how Vi, . . . , Vm were created. 
Also attached to V2m+i is Vsm+i, formed by removing the 
four righthand vertices from the top cell and adding the 
top four vertices from the right half of the third column. 
This branch continues to form Vzm+2 , ■ ■ ■ , Vim analogously 
to Vm+2i ■ ■ ■ , V2m, SO that Vim has the same shape as V2m, 
only one column over 

The remainder of the tree decomposition is created starting 
from Vim+i (formed analogously to V2m+i), until each col- 
umn has been covered with a set of bags which are formed 
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like Vi, . . . ,Vm. This generates a total of 2m^ — m bags, 
each containing exactly cm + c vertices of F{m, c). A small 
example of the beginning of this process on a 3 x 3 grid of 
_ft'4 4 cells can be seen in Fig. |2] The tree associated with this 
tree decomposition can be found in Fig. |3(b)| along with the 
trees associated with the tree decompositions of the 2x2 and 
the 4x4 grids in Fig. |3(a)| and |3(c)| respectively. Note that 
these three trees have the same general shape, with only the 
length of their branches changing, dependent on m. 

We now show why (T, V) satisfies the three properties of a 
tree decomposition from Def. [3j 



1) every vertex of F{m, c) is in at least one bag. 

2) every edge is contained in at least one bag. This can be 
verified by noticing that every cell is fully contained 
in exactly one bag, covering all edges within Kc^c- 
Additionally, for each column, there is a bag containing 
all of the left side vertices of the cells in the column, 
and thus all the vertical intercell edges in the column. 
Finally, as the bags move from one column to the next, 
the right halves of each pair of horizontally adjacent 
cells are contained in a unique bag, thus covering all 
horizontal intercell edges. 
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Fig. 3. Trees of the tree decompositions of the hardware graph with a grid 
of size (a) 2 X 2, (b) 3 X 3, and (c) 4 X 4 which satisfy the upper bound from 
Thm.n 

3) Let V be an arbitrary vertex in F{m, c) and let Vk be the 
lowest index bag in which v appears. Then, as we walk 
along T starting at node k and traveling in the direction 
of increasing node labels, once v is dropped from the 
bag (on any branch) it is never picked up again. Thus, 
the nodes of T which correspond to bags that contain v 
form a connected subtree of T. 

Since (T, V) is a tree decomposition of F{m, c) where every 
bag contains cm + c vertices, it has width cm + c — 1, so 

T{F{'m, c)) < cm + c — 1. 

■ 

While these bounds are not tight for all choices of c, they 
are best possible when c — 1, as cm — cm + c — 1 = m. 

Determining bounds on the treewidth of the hardware graph 
is useful because it allows us to automatically dismiss the 
possibility of embedding certain classes of QUBO problems, 
members of which we might otherwise have spent considerable 
time attempting to embed. 

If lower bounds on the treewidth of the QUBO problems 
are known, these can be combined with the bounds on the 
treewidth of the hardware graph to rule out even more QUBO 
problems. There are many graph-theoretic methods for finding 
lower bounds on treewidth, which use various graph properties 
including smallest degree, second smallest degree, girth, and 
spectral radius. Applying lower bounds to classes of QUBO 
problems is beyond the scope of this paper, but an overview 
of common lower-bound algorithms can be found in [7J. 

VII. Embedding into the Hardware graph 

In general, determining whether an arbitrary graph H can 
be minor-embedded into an arbitrary fabric F is NP-complete. 
The best-known general algorithms assume a fixed input 
graph H (Q], which is the opposite of the situation in the 
quantum programming problem. Additionally, although there 
are polynomial time recognition algorithms for the existence 
of an embedding, they do not produce the embedding and. 



in all cases, the hidden constants are prohibitively large ||6l , 
||25l . Algorithms which allow H to vary along with F are no 
longer polynomial |27| or are limited to specific classes 
of graphs which do not include the hardware graphs described 
in Sec.|V]f20l. 

A. Maximal Minor Embedding 

Given a fabric F on n vertices, the method for finding and 
embedding every possible minor-embeddable problem graph 
P involves solving an NP-complete problem. First, all the 
minors of F must be found and, second, we must determine 
whether P is a subgraph of any of them. The first step can be 
done when fabric is defined but even once all the minors are 
known, every new problem graph P must be checked against 
them for subgraph containment, which is still NP-complete on 
arbitrary inputs. 

The brute force algorithm for finding all possible minors 
of F involves finding the maximal minors: a set of minors 
of F such that every other minor is a subgraph of one of 
the maximal minors. The first maximal minor is F itself. 
Subsequent maximal minors are found by contracting an edge 
in F to form a minor and checking it for subgraph containment 
against the list of maximal minors. If it is not a subgraph of 
any of these, it is added to the list. Once every minor of size 
n — 1 is found (i.e. every possible edge contraction of F has 
been tested), the process is repeated by contracting edges in 
these minors. The process is completed at step k when no new 
maximal minors of size n — k are found. An example of a set 
of maximal minors can be found in Fig. |4] which shows the 
four distinct maximal minors of F(4,4). 

Conceptually, maximal minor embedding is very straight- 
forward. The input graph P is compared to the known list 
of maximal minors for F. However, the comparison requires 
testing for subgraph containment, which is a combinatorial in 
the number of checks that must be performed. Consequently, 




(c) Kq minus 2 edges (d) K$ 

Fig. 4. All minors of a single cell are a subgraph of one of these 4 graphs. 
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(a) embedded in a (b) labelled by hardwai'e vertices 

7^4 4 merged for each logical qubit 

Fig. 5. A Kt:, embedding into a single 7^4,4 cell of qubits. 

maximal minor embedding suffers from two distinct bottle- 
necks, i.e., finding the maximal minor and finding the embed- 
ding. Nonetheless, this method has the benefit of finding the 
optimal embedding with respect to the size of the embedded 
problem. Because smaller embedding sizes may be expected 
to contribute favorably to the scaling of the energy gap, the 
effort required must be weighed against its advantages. 

B. Algorithm to Embed Kn 

Instead of trying to find every possible minor of the hard- 
ware graph, we can find an embedding of Kcm+i- Then, for 
any QUBO problem of size cm + 1 or smaller, the embedding 
problem is solved. The downside of this approach is that it 
will fail to embed many problems that are indeed embeddable 
in the hardware. For example, although the graphs in Fig.HJb) 
and (c) are embeddable in a K^^i^ cell, they are not embeddable 
in i^s, which is the largest A'„ minor in the cell. Because of 
this, the complete-graph embedding algorithm (as described in 
Sec. IVII-BI l requires a 2 x 2 array of four cells in order to find 
an embedding for QUBO problems corresponding to either of 
those graphs. 

Unlike maximal minor embedding, the complete-graph em- 
bedding algorithm is computationally simple albeit at the cost 
of increasesd usage of the logical fabric. This illustrates that 
the two methods described here represent a tradeoff between 
the computational complexity of the embedding algorithm 
and the potential computational complexity of the quantum 
program as measured by the area of the computational fabric. 

Given a hardware graph as described in Sec. |V] our al- 
gorithm to embed Kcm+i as a minor in the m x m grid of 
Kc^c cells is recursive in nature, and constructs the mapping 
described in Def. |2] For the sake of clarity, in the description 
of the algorithm, the elements of the Kcm+i will be referred 
to as nodes and the elements of the hardware graph will be 
referred to as vertices. Let ui, U2, . . . , Wcm+i be the nodes of 
the Kcm+i that we are trying to embed. 

The algorithm begins by embedding the first c + 1 nodes 
(forming a Kc^i) into the cell in the upper left corner of the 
hardware. This is done by pairing left and right vertices c — 1 
times. 
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That is, for 1 < j < c — 1, 4>{uj) = {v-[ The 
next two nodes are each initially mapped to a set containing 
a single vertex: 4){uc) ~ {^^i i}, and 0(mc+i) = {w^'j}. See 
Fig. |5] for an example of embedding in a K^^j^ cell of 8 
qubits. We provide an instance of this algorithm in a Matlab- 
style pseudocode for a function which produces an embedding 
into non-faulty F(m, c) hardware. 

After embedding a Kc+i into the first cell of the hardware 
graph, the m — 1 remaining steps of the algorithm extend the 
embedding into the subsequent row and column of the mxm 
grid. For each step 2 < i < m, the embedding forms an 
extendable clique minor in the i x i grid. We say a minor is 
extendable if it satisfies two conditions: first for Uj, I < j < 
c{i — 1) + 1 the set 4'{uj) is non-empty. Second, each set has 
at least one vertex with an edge into the next row or column. 

For all nodes uj, at least one vertex of (l){uj) is connected 
to a cell in the next row and/or column of the grid. These 
vertices are added to the set <j>{uj). For nodes Uc and Mc+i, 
one vertex is added to ^(mc) and 0(mc+i) at each layer i. For 
all other nodes uj, two new vertices are added to <j>{uj). 

The sets 0(uc(i-i)+2) through (/'('Uc(j-i)+c+i) are formed 
by picking one of the unclaimed vertices on the right side of 
cell This is extended by following the edges from cell 

to cell along row i. When column i is reached, one edge is 
taken within the cell, then edges from cell to cell are followed 
up along column i. At the end of this process, each of these 
sets will contain 2i vertices: for 1 < s < c, (/'(Mc(i-i)+s+i) ~ 
K+^...^;^|^<l,...,<J. 

This process is continued until Kcm+i is fully embedded 
in the mxm grid. See Fig. |6] for an extension of a K13 
embedding in a 3 x 3 grid of 4 cells to a Kit embedding 



function V = no_failure_embedding (c ,m) 
% This function takes an mxm hardware graph 
% of K_{c,c} cells and outputs a (2m)x(cm+l) 
% matrix V where the non— zero entries of V(:,i) 
% are phi(u_i) for u_i in the embedded K_{cm+1} 

V = zeros (2*m, c*m+l) ; 

%Almost all the c*m— 2 sets are formed similarly 
for i = 1 : c *m+ 1 
if i < c 

%First grid row/column, position i in cell 
r = 1 ; s = i ; 
else if i>c+l 

%Calculate row/column of the grid 
r = ceil ((i-l)/c) ; 
%Calculate level within cell 
s = mod ( ( i — 1 ) , c ) ; 
if s==0 
s=c ; 

else 

continue ; %these are handled below 

%fill in the horizontal members of phi(u_i) 
for j =1 :m 

V( j , i ) =2* c*m*( r — l)+2*c*(j — l)+c+s ; 
%fill in the vertical members of phi(u_i) 
for j =1 :m 

V( j +m, i ) =2* c *( r — 1)+2* c*m*( j — l)+s ; 

end 

%At i=c and i=c + l, the sets differ , and have size m 
for j = 1 :m 

V(j ,c)= c + (j— l)*2*c*m; 
V(j , c + l)= j *2*c ; 



C. KLYMKO, B. D. SULLIVAN, AND T. S. HUMBLE 




(a) Ki3 embedded in upper 3x3 sub-grid 
Fig. 6. A Kii embedding in a 3 X 3 grid of cells extended to a K17 

in a 4 X 4 grid of cells. 

In the description of the hardware graph in Sec. [V] the 
vertices were given labels of the form f^. In the Matlab- 
style pseudocode found below, they are numbered from 1 to 
crn^. The numbering starts in the cell in the upper left corner 
as described in Fig |5(a)| and this numbering is continued across 
the row, then across subsequent rows. Given a node position 
in the form v'^ ^, the equivalent number in the code below is 
n — 2cm{a — 1) + 2c{b — 1) + d. Given a node numbered n 
in the code below, the equivalent label is given by ^ with 
a = [-n-l b = r "-2"»(°-i) i and d = n mod 2c, with 

I 2cm I ' I 2c I ' ' 

d — 2c if n mod 2c = 0. 

VIII. Embedding with failed Qubits 

The complete-graph embedding algorithm presented in Sec. 
IVII-BI assumed that there are no failures in the hardware. 
However, the hardware may exhibit some percentage of failed 
vertices which prevent a full Kcm+i embedding (e.g. in the 
case of any single qubit failure, the biggest clique embeddable 
is Kcm)- Instead of losing a node from the Kcm+i for each 
failed qubit, techniques can be employed to embed in a way 
that attempts to minimize the number of sets (j){u) which 
contain any failed qubits. 

We present two algorithms below in order to handle the case 
of fabrics with hard faults. These approaches to embedding test 
the different starting points available from the four corners of 
the m X m grid and then return the best possible embedding 
that results. Additionally, if the largest Kn found is smaller 
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(b) Ki7 embedded in the 4x4 grid 
embedding in a 4 X 4 grid. Each color represents a single logical qubit. 

than the largest possible in an (m— 1) x (m— 1) grid, from each 
corner, we drop the first row and column and reattempt the 
embedding. This "dropping down" procedure continues until 
a large enough clique is found or (m — 1) rows and columns 
have been dropped. 

At the same time, the grid is scanned and the largest Kn 
embeddable in a single cell (1 < n < c+l)is found. If 
a complete cell is found, this is Kc+i- The reported largest 
embeddable Kn output by the algorithm is the maximum of 
the largest clique embeddable inside a single cell and the four 
cliques found from starting at the four corners. 

Combining these two procedures yields a "flip and drop- 
down" method that we compare to the single, nominal attempt 
at embedding, i.e., starting in the upper left corner In all cases, 
the worst performance possible is to embed a Ki, since we 
assume there is at least one working qubit in the hardware. 
Note details of the corner selection and drop-down methods 
are not shown in the pseudocode. 

A. Dropping to a smaller cell-graph 

Given an m x to hardware graph with cells of Kc.c, one way 
to deal with failed qubits is to find the largest Co, Co < c, such 
that there is a complete m x m grid of Kc^,cJ^ and use the 
algorithm described in Section IVII-BI to embed into this sub- 
grid. This will lead to an embedding of size CoTO + I < cm + 1. 
Once the Co has been determined, the embedding can be found 
by renumbering the vertices of the hardware graph to reflect 
the new cell size and running no_failure_embedding(co, m). 
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B. Greedy failure algorithm 

As can be seen in Fig. |6l given a perfect mxm grid of Kc,c 
cells, for each node u of the embedded Kcm+i (other than 
nodes Uc and Uc+i started in the first cell), contains 2m 
vertices. These consist of two sets of m vertices: a connected 
set consisting of one vertex from the left side of each cell in 
the a single column in the grid and a connected set consisting 
of one vertex from the right side of each cell in the row 
of the same number Due to the pattern in which cells are 
connected, within both of these sets every vertex occupies the 
same position in the cell it comes from. 

The greedy failure algorithm works to maximize the size of 
the complete Kn which can be embedded in the hardware 
graph with failed vertices, by attempting to pair up sets 
containing failed vertices with other sets containing failed 
vertices to create full nodes. These "match-ups" occur in the 
diagonal cells of the grid. In the case of no failures, each 
horizontal set (of vertices from the right halves of cells) is 
matched with a vertical set (of vertices from the left halves of 
cells) whose vertices occupy the same 'height' inside a single 
cell. When there are errors, however, horizontal sets containing 
failed vertices attempt to match with vertical sets that also 
contain failed vertices, regardless of the 'heights' at which 
the vertices sit inside a cell. By matching sets which contain 
failures, the number of complete nodes (all of which except 
Uc and Uc+i are made up of two sets) containing failures is 
reduced and, consequently, a larger embedded Kn is achieved. 

The Matlab-style pseudocode for a function which produces 
the nodes of the embedding described above and outputs the 
number of nodes containing no errors can be found at right. 

C. Analysis 

A comprehensive set of experiments were run to see how 
well the fallback and greedy algorithms from Sees. IVIII-AI 
and IVIII-BI respectively, performed under various conditions 
of vertex failure. These experiments were run using a single 
attempt at embedding that begins in the upper left corner of 
the grid of cells as well as a run using the flip and drop- 
down scheme described at the beginning of Sec. IVIIII In all 
cases, the hardware graph was an to x m grid of K4^4 cells. 
The grid sizes tested were m = 4,8, 16, and 32. For each of 
these grid sizes, the algorithms were run with a percentage of 
failed vertices of p = 2, 4, 5, 6, 8, 10, 15, 20 and 25. The failed 
vertices were uniformly distributed across the hardware graph. 
In each of the 148 cases (defined by algorithm, scheme, grid 
size, and failure rate), 10,000 randomized instances were run 
to compute statistical averages. 

A comparison of the results shown by Figs. [T] and [8] 
illustrates that the flip and drop-down embedding scheme 
performs better than a single attempt at embedding from 
the upper left corner and that the greedy algorithm performs 
better than the fallback method. In both schemes, the greedy 
algorithm embeds a Kn with n approximately 85% of the 
optimum value at two percent failure rate. 



function [V,k] = greedy_embedding (c ,m,G) 
% This function takes an mxm hardware graph 
% of K_{c,c} cells and a list G of failed 
% vertices. Outputs are a (2m)x(cm+l) matrix V, 
% where non— zero entries of V(:,i) are phi(u_i) 
% for u_i in the embedded K_{cm+1}, and k is the 
% number of failure— free sets phi(u_i). 

%Helper Function: PAIR(s,t,cv) 

%stores the union of F(:,s) and F(:,t) in V(;,cv) 

%First , we form all of the half — sets in a matrix F 
F = zero s (m,2 * c*m) 
for i = 1 :m 

for pos = l:c 

%determine columns of F to be filled 

Cnum = 2* c *( i — l)+pos 

Rnum = 2*c *( i — l)+pos+c 

for j =1 :m 

%half — sets in col i of hardware graph 
F( j ,Cnum)=2*c*m*( i— l)+2*c*(j— l)+pos+c 
%half — sets in row i of hardware graph 
F( j , Rnum) =2* c*m*( j — l)+2*c *( i — l)+pos 
end %of for i = 1 :m 

% Match half — sets for each row/column to minimize 

% number of full sets containing failed vertices. 

V=zeros(2*m, c*m+l) 

cv = 1; %first open column of V 

k = 0; %number of failure— free full sets created 

for i = l:m 

Fi= 2c*(i— 1) "/boffset for column indices in F 
%Pair up sets containing failures 
for s = 1 : c 

if i==l and cv=c 

break; %go create size m sets 
if F(: ,Fi + s) contains a failure in G 
for t = l:c 

if F(:,Fi+c+t) contains a failure 
PAIR(s ,c+t ,cv) 

CV++ 

break 
end %of for s = 1 : c 

%Pair remaining half — sets arbitrarily until 
%c — 1 (i = l) or c (i>l) whole sets have been made 
for s = 1 : c 

if ((i==l and cv==c) or (cv==c*i+2)) 

break; %create size m sets or next i 

if F(: ,Fi + s) unpaired 
for t = 1 : c 

if F(: ,Fi+c + t) unpaired 
PAIR(s , c+t , cv) 

CV++ 

if V(:,cv) failure— free 
k=k+l 
end %of for s = 1 : c 

%Create two size m sets in row/column 1: 
for s = 1 : c 

if F(:,s) unpaired 

V(: ,cv) <- F(: ,s) 

CV++ 

if F(:,s) failure— free 
k=k+l 
if(F:,c+s) unpaired 

V(: ,cv) <- F(: ,c+s) 
CV++; 

if F(:,c + s) failure— free 
k=k+l 

end %of for s = 1 : c 
end %of for i =1 :m 
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(a) Single attempt at embedding (b) Maximum over flip and drop-down embeddings 

Fig. 7. Percent of the maximum embeddable Kn acliieved for both the fallback and greedy embedding schemes for various percentages of failed vertices, 
averaged over 10,000 trials. This is calculated for both a single attempt at the embedding (left) and multiple attempts at the embedding, starting in all four 
corners and, if necessary, dropping to a smaller grid (right). Both methods also search for a whole cell. 



At fixed failure rate, the percent of the maximum embed- 
dable Kn for both algorithms decreases as the grid size m 
grows. This is due to the fact that the number of hardware 
vertices mapped to a single node of the Kn minor increases 
linearly with grid size. On the 4x4 grid, each set is 
made up of 8 vertices (except for 2 special cases). Given a 
2% failure rate, this means that any (p{u) on the 4x4 grid 
(with no attempt at a 'smart' embedding scheme) has a 16% 
chance that the set contains at least one failed vertex (and thus 
can not augment the size of the Kn embedded). Similarly, on 
the 32 X 32 grid, each (j){u) contains 64 vertices, and for 2% 
failure having at least one failed vertex per cell is highly likely. 

At 2% failure rate, the greedy embedding scheme with flips 
and drop-downs achieves embedding of a complete graph of 
over 40% the size of the maximum Kn embeddable. For 
the worst case scenario, and with no attempt at a 'smart' 
embedding, it would only take one failed vertex to destroy 
each logical qubit. Even at only a 2% failure rate, the 32 x 32 
grid has on average 163 failed vertices. If the algorithm did 
not adapt, this high failure density would completely destroy 
the maximum embeddable clique, which is a Ki2g. In the case 
of a 25% failure rate, the number of failed vertices jumps to 
2048, yet the greedy failure algorithm is still able to embed a 
Kq on average. 

We have also analyzed the variances in embeddability 
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Fig. 9. Variances of fault-tolerant embedding algorithms (with flip and drop- 
down) on the 4x4 grid. 



from these experiments. In the case of a single attempt at 
embedding, the distribution of embeddable graphs tends to be 
narrower than when using the drop-down scheme. For larger 
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Fig. 10. Histograms for 10,000 trials of the fallback (left) and greedy (right) 
embeddings with flipping and drop-down on the 4x4 grid at p = 2,4, 8, 
and 10 percent failure of the nodes. 
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(a) Fallback embedding algorithm (b) Greedy embedding algorithm 

Fig. 8. Percent of the maximum embeddable Kn achieved in both a single attempt at embedding and multiple attempts (starting in each of the four corners 
and, if necessary, dropping down) for various percentages of failed vertices, averaged over 10,000 trials. This is calculated both for the fallback method (left) 
and for the greedy method (right). 



grid sizes and for higher percentages of failure, the variance 
of the single attempt falls to zero. This is caused by the fact 
that the algorithm never does better than embedding a 
into a single, complete cell. However, this happens less often 
for the drop-down embedding scheme, yielding larger average 
Kn with higher variances. An example of this behavior is 
shown in Fig. |9] for the case of F{A,A) when varying the 
percent failure rate. It is notable that while the variance of 
the fallback method is relatively large for small error rates, 
the greedy algorithm maintains a near constant, much lower 
variance across all failure rates. In Fig. (TO] the distribution 
of achieved embeddings over 10,000 trials using the flip and 
drop down scheme on F{4, 4), with the percentage of failed 
vertices at p = 2,4,8, and 10, is shown. The embeddings 
achieved by the greedy algorithm are both more clustered and 
larger than those achieved by the fallback algorithm. With the 
added evidence of panel (b) in Figs. [T] and [8] this demonstrates 
the greedy approach is more robust in the presence of hard 
faults. 

IX. Conclusions 

We have presented methods for adiabatic quantum pro- 
gramming that embed problem specific information into an 
underlying quantum logical fabric. Our methods include an 
embeddability analysis based on the treewidth of an m-by-m 
lattice of Kc^c unit cells, which is a generalization of existing 
adiabatic quantum hardware. This has provided bounds on the 
graphs that can be embedded in a predefined logical fabric and 
should be useful for guiding adiabatic quantum programmed 
implementations . 

In addition, we have presented two new methods for finding 
an embedding of a complete graph in faulty fabric. The first 
method handles failures by falling back to a set of smaller 
available unit cells, while the second searches for embeddings 
that minimize the number of affected logical qubits using 
matching within cells on the diagonals. The latter was shown 



to have greater power for programming implementations of 
arbitrary QUBO instances. Numerical studies of embeddability 
run against randomized failures further showed the relative 
robustness of the second algorithm and the remarkably smaller 
variance in embeddable graphs. 

In our study of embedding for adiabatic quantum pro- 
gramming, we have neglected any question regarding the 
subsequent computational complexity. The question of how 
a particular embedding algorithm impacts the complexity of 
the resulting AQO program is a point for future research. 
The current work, however, is expected to support uncovering 
the dependency of the computational complexity on both the 
embedding and parameter setting methods used. We believe 
that the embedding algorithms explored here, which provide 
a constructive approach to programming, will be useful for 
providing a consistent means of comparing the AQO algorithm 
across different problem sizes and hardware. 
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